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Abstract 

Genomes of the plant-pathogenic genus Phytophthora are characterized by small duplicated blocks consisting of two consecutive 
genes (2HOM blocks) and by an elevated abundance of similarly aged gene duplicates. Both properties, in particular the presence of 
2HOM blocks, have been attributed to a whole-genome duplication (WGD)atthe last common ancestor of Phytophthora. However, 
large intraspecies synteny — compelling evidence for a WGD — has not been detected. Here, we revisited the WGD hypothesis by 
deducing the age of 2HOM blocks. Two independent timing methods reveal that the majority of 2HOM blocks arose after divergence 
of the Phytophthora lineages. In addition, a large proportion of the 2HOM block copies colocalize on the same scaffold. Therefore, the 
presence of 2HOM blocks does not support a WGD at the last common ancestor of Phytophthora. Thus, genome evolution of 
Phytophthora is likely driven by alternative mechanisms, such as bursts of transposon activity. 
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Phytophthora: a Paleopolyploid 
Genus? 

Whole-genome duplications (WGDs) are powerful facilitators 
of evolution, because they generate large amounts of raw 
genetic material. By duplicating the entire set of genes, 
WGDs pave the way for neo-, subfunctionalization, and recip- 
rocal gene loss (Scannell et al. 2006; Semon and Wolfe 2007). 
Therefore, WGDs have been associated with important evo- 
lutionary events such as extensive radiation and adaptation 
towards changing environments (Christoffels et al. 2004; 
Young et al. 201 1). Ancient WGDs have been observed in a 
variety of eukaryotic lineages, including vertebrates, flowering 
plants, and the unicellular ciliates (Dehal and Boore 2005; 
Aury et al. 2006; Amborella Genome Project 2013). Such 
paleopolyploids are often identified by the presence of in- 
traspecies synteny in addition to many duplicated genes that 
arose at the same time (Van de Peer 2004). Recently, it has 
been suggested that a WGD occurred before the speciation of 
several extant Phytophthora (Martens and Van de Peer 2010), 



a genus of important plant-infecting eukaryotic microbes 
(Jiang and Tyler 2012). 

The genus Phytophthora encompasses more than a hun- 
dred species that infect numerous plant lineages causing dev- 
astating diseases, such as potato late blight (Phytophthora 
infestans) and sudden oak death (P. ramorum) (Govers and 
Gijzen 2006; Tyler et al. 2006; Haas et al. 2009; Kroon et al. 

201 2) . The extensive radiation within this genus, accompanied 
by adaptation of its members to novel environments and dis- 
tinct hosts specificities, makes a WGD in the last common 
ancestor of Phytophthora a compelling scenario. Moreover, 
polyploidy has been reported in some extant Phytophthora 
species, highlighting the potential influence of this process 
in the evolution of this genus (Sansome 1977; Bertier et al. 

2013) . Recently, small duplicated blocks consisting of two di- 
rectly adjacent genes (termed 2HOM blocks) were found to be 
abundant in Phytophthora (Martens and Van de Peer 2010). 
This genomic feature has been attributed to an ancient WGD 
in Phytophthora succeeded by extensive genomic rearrange- 
ments. Moreover, a WGD provides a simple explanation for 
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the increased gene duplication frequency at the Phytophthora 
ancestor (Martens and Van de Peer 2010; Seidl et al. 2012). 

However, small-scale duplications cannot be excluded as a 
possible driving force for the appearance of 2HOM blocks, 
especially because no larger regions of intraspecies synteny 
could be detected (Haas et al. 2009; Martens and Van de 
Peer 2010). Moreover, in-depth re-examination of the 
2HOM blocks revealed several features that challenge the 
WGD hypothesis. For example, many 2HOM blocks have 
their copies located on the same scaffold or contain transpos- 
able elements (TEs) (supplementary table S1, Supplementary 
Material online), thus most likely reflecting segmental dupli- 
cations and transposition, respectively. 

Because 2HOM blocks nevertheless represent intraspecies 
synteny in the Phytophthora genomes, widely considered 
the most important indicator of WGDs, we uncovered their 
evolutionary origin. We applied two independent methods to 
estimate the duplication age of 2HOM blocks. First, we deter- 
mined the timing of 2HOM block duplication by a combina- 
tion of presence/absence patterns and phylogenetic analysis. 
Second, we used synonymous substitutions (K s ) as a proxy for 
duplication time. We established that the majority of 2HOM 
blocks emerged very recently, following the radiation of 
Phytophthora. Consequently, the presence of 2HOM blocks 
is not a remnant of a WGD in the last common ancestor of 
Phytophthora. 

The Origin of 2HOM Blocks Revisited 

To infer homology within and across genomes of six divergent 
Phytophthora lineages and distantly related oomycetes (fig. 1) 
(Tyler et al. 2006; Haas et al. 2009; Baxter et al. 2010; 
Levesque et al. 2010; Lamour et al. 2012; Jiang et al. 2013), 
we clustered all predicted proteins based on sequence similar- 
ity (see supplementary materials and methods, Supplementary 
Material online). Using the resulting gene families, we identi- 
fied 2HOM blocks in the Phytophthora species and subse- 
quently screened the other oomycetes for copies of these 
2HOM blocks. TEs are common in Phytophthora genomes, 
and their predicted gene content contains many TE remnants 
(Haas etal. 2009; Seidl etal. 201 1). To exclude blocks in which 
one or both constituent(s) could have resulted from transpo- 
sition rather than duplication, we filtered TE -containing 
2HOM blocks (supplementary Materials and Methods, 
Supplementary Material online, and table 1). 

We observed between 98 (P. capsici) and 244 (P. infestans) 
2HOM blocks in the Phytophthora genomes (table 1). The 
numbers of 2HOM blocks in the genomes of P. infestans 
and P. ramorum are comparable to those that have been 
found before (Martens and Van de Peer 2010). However, 
we identified fewer blocks in P. sojae (131 vs. 207). 
Variations between both analyses are likely caused by our 
more stringent TE filtering and by differences in determining 
gene families (see supplementary table S1, Supplementary 



single-gene and small-scale duplications 




Phytophthora parasitica | 

Clade 1 

Phytophthora infestans I 
Phytophthora capsici I Clade 2 
Phytophthora sojae I 

Clade 7 

Phytophthora cinnamomi 
Phytophthora ramorum | Clade 8 
Hyaloperonospora arabidopsidis 
Pythium ultimum 
Saprolegnia parasitica 



Fig. 1. — Species tree, as previously established by Blair et al. (2008), 
with gene (and small-scale) duplication frequency indicated. The gene 
duplication frequency is elevated at the last common ancestor of 
Phytophthora and in some extant Phytophthora lineages. 



Material online, for TE content in the previously published 
2HOM blocks and supplementary table S2, Supplementary 
Material online, for a comparison of both 2HOM block sets). 

Surprisingly, a high fraction of all 2HOM blocks identified 
(29%) has at least two copies located on the same scaffold 
(table 1). For example, both copies of the P. infestans 2HOM 
block containing a vesicle transporter gene and a riboflavin 
biosynthesis gene are situated on supercontig 15 (fig. 2A). 
Such a high fraction of colocalization — up to 47% in P. para- 
sitica and P. sojae — is not expected if blocks are derived from a 
WGD; those are more likely to be localized on different chro- 
mosomes or scaffolds. Although genomic rearrangements 
could lead to colocalization of some WGD-derived homolo- 
gous blocks, reshuffling seems unlikely to account for all in- 
stances, especially considering the high number of scaffolds in 
Phytophthora genome assemblies (between 708 and 4,921). 
Moreover, the number of colocalizing 2HOM blocks in each 
species (excluding P. cinnamomi) is significantly higher than 
expected by random localization (P value < 0.0001; supple- 
mentary Materials and Methods, Supplementary Material 
online). Therefore, the colocalized 2HOM blocks are more 
likely the products of segmental duplication events than of 
an ancient WGD. 

To precisely time the origin of 2HOM blocks in different 
Phytophthora genomes, we categorized each duplication 
event that creates a 2HOM block as "private" (species- 
specific) or "shared" (ancestral). A 2HOM duplication is pri- 
vate if it occurred more recently than any speciation, and it is 
shared if it precedes a speciation between at least two differ- 
ent Phytophthora species. To categorize the duplications, we 
inferred the timing of duplications using two complementary 
methods. The first method is based on presence: If a 2HOM 
block only occurs in a single Phytophthora species, that is, not 
a single copy of this block was detected in any of the other 
genomes, we parsimoniously inferred the number of duplica- 
tions which created the copies and categorized these as "pri- 
vate. " The second method consists of phylogenetic timing: If a 
2HOM block has copies in multiple species, we inferred sep- 
arate phylogenetic trees for the two genes that form this 
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Table 1 

Numbers of 2H0M Blocks in the Examined Genomes 



Species 


2HOM 


2HOM Blocks 


Categorization of 2HOM Block Duplications 




Blocks 


Filtered 










Same Scaffold 


Private 


Shared 








Duplications 


Duplications 


P. capsici 


474 


98 36 (P<0.0001) 


78 (77%) 


23 


P. cinnamomi 


609 


183 0 


148 (80%) 


37 


P. infestans 


396 


244 70 (P<0.0001) 


227 (93%) 


16 


P. parasitica 


255 


104 49 (P<0.0001) 


61 (74%) 


21 


P. ramorum 


252 


145 43 (P<0.0001) 


112 (84%) 


22 


P. sojae 


282 


131 62 (P<0.0001) 


99 (80%) 


25 



Note. — The 2HOM blocks were filtered for large families and for TEs. The number of filtered 2HOM blocks with copies on the same 
scaffold was assessed for significance by genome randomization. Of each filtered 2HOM block, the timing of underlying duplication(s) was 
inferred either from phylogenetic trees or from its single-species occurrence and categorized accordingly. 



particular block and interpreted these according to the species 
tree (fig. 1, see also supplementary Materials and Methods, 
Supplementary Material online). 

Two examples demonstrate the phylogenetic timing of 
2HOM duplications (fig. 2). Figure 2A depicts a P. infestans 
2HOM block that has copies in five additional species. The 
duplication that gave rise to this 2HOM block is species- 
specific (private), and thus this 2HOM block cannot be the 
result of any ancestral duplication, neither segmental nor 
whole-genome. In contrast, the genes encoding short chain 
dehydrogenases and 1 2-oxophytodienoate reductases consti- 
tute a 2HOM block with a more complex evolutionary history 
(fig. 26). In P. ramorum, P. sojae, and P. capsici, the 2HOM 
block is present due to a shared duplication. In P. infestans, 
this 2HOM block is formed by a private duplication. Even 
though this block has, at least for three Phytophthora lineages, 
a shared ancestry, both duplicate copies are located on the 
same scaffold in their genomes and therefore is likely not a 
WGD remnant (see discussion above). Evaluating both 
approaches (single-species occurrence and phylogenetic tree 
interpretation), the majority (across all species 725 out of 869, 
at least 74% in individual species) of duplications that gave 
rise to 2HOM blocks was inferred to be private (table 1). 

Interestingly, the relatively young age of 2HOM block du- 
plications revealed by phylogenetics and presence is not in 
accordance with the age of 2HOM blocks observed by 
Martens and Van de Peer (2010). Their /C s -based timing re- 
vealed a large fraction of 2HOM blocks originating at approx- 
imately the same time before speciation, indicated by a peak 
in the distribution at higher K s . If duplication rate is relatively 
constant, one expects to see an exponential decay in the K s 
distribution, with a single peak at low K s constituted by recent 
duplications that have not been subject to degradation yet. 
A second peak at higher K s values, as demonstrated by 
Martens and Van de Peer (2010), suggests that on top of a 
relatively constant duplication rate, in the past a high amount 
of duplications occurred in a very short time interval. Thereby, 
this peak is potentially indicative of a WGD, as it might be the 



result of a single duplication event (Lynch and Conery 2000; 
Blanc and Wolfe 2004). We similarly assessed the age of our 
2HOM blocks by calculating K s of the paralogous gene pairs 
constituting these blocks. The /C s -inferred age of duplications 
confirms the duplication categorization, because most para- 
logs constituting 2HOM blocks have lower K s values than 
orthologs, indicating duplications that occurred more recently 
than speciation (fig. 3/4). To further delineate the differences 
between the two outcomes, we analyzed the original data of 
Martens and Van de Peer (201 0). In-depth reanalyses of these 
data revealed significant inconsistencies in 2HOM block para- 
log definition and thus K s inference. Once correcting for these 
inconsistencies, their primary data yield K s distributions point- 
ing to a more recent origin of the 2HOM blocks, similar to the 
results of this study (supplementary fig. S2, Supplementary 
Material online). Thus, we conclude that most 2HOM blocks 
are of recent, species-specific origin. We cannot exclude the 
possibility that some of the very young 2HOM blocks are in 
fact alleles and not duplicates (e.g., fig. 24). However, due to 
the absence of a substantial number of ancient 2HOM blocks, 
the 2HOM blocks do not support a WGD in the last common 
ancestor of Phytophthora. 

High Frequency of Gene Duplications 
in Phytophthora Ancestor 

In addition to the 2HOM blocks, a second WGD indicator is 
the observed high frequency of gene duplications at the last 
common ancestor of Phytophthora (Martens and Van de Peer 
2010; Seidl et al. 2012). We investigated whether this pro- 
posed increase in gene duplications persists when including 
additional Phytophthora genomes and applying a more strin- 
gent TE filtering. 

We calculated K s for the Phytophthora paranomes, that is, 
the set of all paralogous gene pairs within a genome (fig. 36 
and supplementary fig. S1, Supplementary Material online). 
We observed a small second peak that deviates from the ex- 
ponential decay in most Phytophthora genomes, therefore 
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Fig. 2. — Two examples of phylogenetically analyzed 2HOM blocks. For both 2H0M blocks, phylogenetic trees of the genes and an illustration of the 
positions on the scaffolds are given. Bootstrap supports based on 100 replicates are displayed on the branches. (A) A Phytophthora infestans 2HOM block, 
likely the result of a species-specific, possibly segmental duplication, is displayed. (6) A 2HOM block of P. infestans (species-specific duplication), P. ramorum, 
P. sojae, and P. capsici (derived from a shared, possibly segmental duplication) has a more complex duplication history. 



indicating a slightly increased duplication frequency before the 
radiation of Phytophthora. Martens and Van de Peer (2010) 
initially showed a more pronounced second peak in the para- 
nome K s distributions. The observed inconsistency is largely 



due to differences in the calculation of equilibrium codon fre- 
quencies, which one has to estimate to calculate K s (see sup- 
plementary figs. S3-S6, Supplementary Material online, for a 
comparison of K s distributions of P. infestans, P. ramorum, and 
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Fig. 3.— K s distributions of Phytophthora infestans. (A) 2HOM block 
paralogs and (6) full paranome. Peaks in the distributions of orthologous 
sets of gene pairs indicate speciation events. Dashed lines indicate exam- 
ples of distributions following a pattern of exponential decay. Full para- 
nome distributions of the other Phytophthora spp. are available in 
supplementary figure S1, Supplementary Material online. 



P. sojae under different models of codon frequencies). The 
method applied by Martens and Van de Peer (2010) assumes 
that all codons are equally likely to be part of coding sequence. 
It therefore ignores codon bias, which is a prominent phenom- 
enon in Phytophthora (Jiang and Govers 2006; Tripathy and 
Tyler 2006). Instead, we applied a method that estimates the 
codon frequencies based on the average nucleotide frequen- 
cies at the three codon positions and thus accommodates 
codon bias (Yang and Nielsen 1998) yielding more reliable 
K s distributions. 

To further delineate the precise point of gene duplication 
and to trace gene duplication in spite of subsequent gene 
losses, we applied phylogenetic timing to all gene families, 
like we did for the 2HOM blocks. We constructed phyloge- 
netic trees for the corresponding protein families and auto- 
matically reconciled them with the species tree similar to Seidl 
et al. (2012). This analysis revealed very high numbers of du- 
plications in some of the extant Phytophthora species, and also 
a burst of gene duplications in the Phytophthora common 



ancestor (fig. 1 and supplementary fig. S7, Supplementary 
Material online). This relative abundance of gene duplications 
persists even if we account for possible wrongly inferred tree 
topologies by excluding ancestral duplications of which none 
of the extant species retained both copies (Vilella et al. 2009) 
(supplementary fig. S7, Supplementary Material online). Given 
this requirement of gene copy retention, the last common 
ancestor of P. infestans, P. parasitica, and P. capsici only expe- 
rienced 281 gene duplications, whereas the Phytophthora an- 
cestor and the today-living lineage P. infestans had 1,651 and 
1,353 gene duplications, respectively. 

At a first glance, the full paranome K s distributions and 
results from the tree reconciliation seem contradictory. 
However, in contrast to tree reconciliation, /C s -based age esti- 
mation can only be accomplished for paralogous gene pairs of 
which both copies are still present. If for many duplicates one 
copy was lost over time, the elevated gene frequency of gene 
duplication would be disguised in the K s distributions. In the 
tree reconciliation analysis, the steep decrease in inferred an- 
cient duplications with increasing strictness in duplicate reten- 
tion likely reflects such gene loss. It suggests that for a high 
proportion of ancient duplications, the different species lost 
either one or the other copy over evolutionary time (supple- 
mentary fig. S7, Supplementary Material online). 

The WGD hypothesis ascribes the high number of gene 
duplications in the last common ancestor of Phytophthora to 
a single event. However, this hypothesis demands extensive 
subsequent reshuffling to explain the loss of syntenic regions 
longer than one gene. Furthermore, this reshuffling should 
have occurred in a short time interval before speciation of 
Phytophthora, because extant divergent Phytophthora species 
show extensive interspecies colinearity (Tyler et al. 2006; Haas 
et al. 2009; Lamour et al. 2012). Moreover, even between 
Pythium ultimum and P. infestans (fig. 1), a considerable 
level of gene-order conservation has been reported (Adhikari 
et al. 2013). Therefore, we conclude that a WGD at the last 
common ancestor with subsequent reshufflings is not very 
likely and so far not supported by the available data. 

Outlook 

Even though Phytophthora genomes contain an elevated 
number of 2HOM blocks, timing by duplication reconstruction 
and by synonymous substitutions reveals that these are not 
the result of a genus-shared WGD. Instead, many blocks have 
a lineage-specific origin and/or are located on the same scaf- 
fold. The exponential age distribution of 2HOM block paralogs 
suggests a constant rate of block duplications followed by 
decay that continues to date. Nevertheless, many gene dupli- 
cations occurred in the genome of the last common ancestor 
of Phytophthora, of which only a small amount of paralogs are 
present in extant species. The mechanism responsible for this 
gene duplication burst could in fact be the same as for the 
occurrence of block duplications. A potential source of such 
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small-scale duplications is the activity of TEs, where so-called 
transduplication takes non-TE coding sequences along (Jiang 
et al. 2004). Taking into account proposed roles for TEs in 
speciation and in generating genetic novelty (Rebollo et al. 
2010; Wang et al. 201 1) and the high repeat and TE content 
of many Phytophthora genomes (Tyler et al. 2006; Gijzen 
2009; Haas et al. 2009), TEs might have played a significant 
role in the genome evolution of Phytophthora. 

Materials and Methods 

Complete information regarding all materials and methods 
can be found in the supplementary Materials and Methods, 
Supplementary Material online. 

Genome data of nine oomycete species were downloaded. 
Across these genomes, protein families were defined using 
BlastP and MCL (Altschul et al. 1997; Enright et al. 2002). 
Families containing at least one gene with considerable se- 
quence similarity to known TEs were flagged as TE families 
(Jurka et al. 2005). Each genome was screened for 2HOM 
blocks with post hoc filtering of blocks comprised of large 
protein families and/or TE families. Maximum-likelihood phy- 
logenetic trees were inferred for genes comprising 2HOM 
blocks (Stamatakis 2006). K s was calculated for the transcripts 
of all possible pairs in a protein family using codeml (Yang 
2007). For each protein family, a neighbor-joining tree was 
estimated and automatically reconciled with the species tree 
using NOTUNG (Chen et al. 2000). 

Supplementary Material 

Supplementary tables S1 and S2, figures S1-S7, and materials 
and methods are available at Genome Biology and Evolution 
online (http://ww.gbe.oxfordjournals.org/). 
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